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Phase transitions and all that 


Gabriel Istrate* 


1 Introduction 

Since the experimental paper of Cheeseman, Kanefsky and Taylor |(T) phase transitions in combinatorial 
problems held the promise to shed light on the “practical” algorithmic complexity of combinatorial problems. 
However, the connection conjectured in |[T) was easily seen to be inaccurate. A much more realistic possible 
connection has been highlighted by the results (based on experimental evidence and nonrigorous arguments 
from Statistical Mechanics) of Monasson et al. [j2j| (see also |3)). These results supported the conjecture that 
it is first-order phase transitions that have algorithmic implications for the complexity of restricted classes 
of algorithms, including the important class of Davis-Putnam-Longman-Loveland (DPLL) algorithms (4). 

There exists, indeed, a nonrigorous argument supporting this conjecture: phase transitions amount to 
nonanalytical behavior of a certain order parameter, the phase transition is first order if the order parameter is 
actually discontinuous. At least for random k-S AT |[5j the order parameter suggested by Statistical Mechanics 
considerations has a purely combinatorial interpretation: it is the backbone of the formula, the set of literals 
that assume the same value in all optimal assignments. But intuitively one can relate (see e.g. the presentation 
of this argument by Achlioptas, Beame and Molloy 13) the size of the backbone to the complexity of DPLL 
algorithms, when run on random k- SAT instances slightly above the phase transition: All literals in the 
backbone require well-defined values in order to satisfy the formula. But a DPLL algorithm has very few 
ways to know what those “right” values are. If w.h.p. the backbone of formulas above the transition contains 
a positive fraction of the literals that is bounded away from zero as we approach the transition (which happens 
in a case of a first-order phase transition) then, intuitively, DPLL will nrisassign a variable having Q (n) height 
in the Lee representing the behavior of the algorithm, and will be forced to backtrack on the given variable. 
The conclusion of this intuitive argument is that a first-order phase transition implies a lower bound for 
the running time of any DPLL algorithm, valid with high probability for random instances located slightly 
above the transition. 

While previous rigorous results 0113, supported these intuitions, to date, the extent of a connection 
between first-order phase transitions and algorithmic complexity was unclear. 

The goals of this paper are 

1. To remedy this, and formally establish a connection between first-order phase transitions and 
the resolution complexity of random satisfiability problems, and 

2. To take steps towards obtaining a complete classification of the order of phase transition in gen¬ 
eralized satisfiability problems. 

To accomplish these goals 
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1. we obtain (Theorem Q a complete characterization of sharp/coarse thresholds in the random general¬ 
ized satisfiability model due to Molloy da. “Phyisical” arguments (see discussion below) imply that 
it makes no sense to study the order of the phase transition unless the problem has a sharp threshold. 

2. we rigorously prove (Theorem |2j» that random 3-SAT has a first-order phase transition. We extend this 
result in several ways: first (Theorem |6ji to random (2 + p) -satisfiability, the original problem from 
0, obtaining further theoretical support to the heuristic results of £D. Second we give a sufficient 
condition (Theorem [4j for the existence of a first-order phase transition. We then show (Theorem |5} 
that all problems whose constraints have no implicates of size at most two satisfy this condition. 

3. we show that in all the cases where we can prove the existence of a first-order phase transition, such 
problems have a 2' l(n> lower bound on their resolution complexity (and hence the complexity of DPLL 
algorithms as well j4j). Indeed, the two phenomena (2- l<n) resolution complexity and the existence of 
a first-order phase transition) have common causes. 

4. in contrast, we show (Theorem 0 that, for any generalized satisfiability problem, a second-order phase 
transition implies, for every a > 0, a 0(2 a ' n ) upper bound on the resolution complexity of their 
random instances (in the region where most formulas are unsatisfiable). 


2 Preliminaries 

Throughout the paper we will assume familiarity with the general concepts of phase transitions in combina¬ 
torial problems (see e.g. (TT|), random structures 112H . proof complexity 1131 . Some papers whose concepts 
and methods we use in detail (and we assume greater familiarity with) include ei, Da, ca. 

Consider a monotonically increasing problem A = (A n ), under the constant probability model T(n,p), 
that independently sets to 1 with probability p each bit of the random string. As usual, for e > 0 let p e = p e (n) 
define the canonical probability such that Prob ;rGr ( ri J)e ( n ))[a: € A] = e. 

The probability that a random sample x satisfies property A (i.e. x £ A) is a monotonically increasing 
function of p. Sharp thresholds are those for which this function has a “sudden jump” from value 0 to 1: 

Definition 1 Problem A has a sharp threshold iff for every 0 < e < 1/2, we have lim n ^ l0O = 0- 

A has a coarse threshold if for some e > 0 it holds that lim, woo > 0. 

For satisfiability problems (whose complements are monotonically increasing) the constant probability 
model amounts to adding every constraint (among those allowed by the syntactic specification of the model) 
to the random formula independently with probability p. Related definitions can be given for the other two 
models for generating random structures, the counting model and the multiset model ca Under reasonable 
conditions ca these models are equivalent, and we will liberally switch between them. In particular, for 
satisfiability problem A, and an instance 4> of A, ca (<b) will denote its constraint density, the ratio between 
the number of clauses and the number of variables of 4>. To specify the random model in this latter cases we 
have to specify the constraint density as a function of n, the number of variables. We will use ca to denote 
the value of the constraint density c^($) (in the counting/multiset models) corresponding to taking p = P 1/2 
in the constant probability model, ca is a function on n that is believed to tend to a constant limit as n —r oo. 
However, Friedgut’s proof fl4l of a shaip threshold in £>SAT (and our results) leave this issue open. 

The original investigation of the order of the phase transition in A:-SAT used an order parameter called the 
backbone. Bollobas et al. (8) have investigated the order of the phase transition in 2-SAT under a different 
order parameter, a “monotonic version” of the backbone called the spine. 




Spme(<3?) = {x G Lz£|(3)H C <J>, E E <SAT, S A {x} E SAT}. 


( 1 ) 


They showed that random 2-SAT has a continuous (second-order) phase transition: the size of the spine, 
normalized by dividing it by the number of variables, approaches zero ( as n —> oo) for c < c^-sat = 1, 
and is continuous at c = c- 2 -sat- By contrast, nonrigorous arguments from Statistical Mechanics ( 5 j imply 
the fact that for 3 — SAT the spine jumps discontinuously from zero to positive values at the transition point 
c = cs-sat (a first-order phase transition). 

It is easy to see that the intuition concerning the connection between the complexity of DPLL algorithms 
and the size of the backbone (discussed briefly in the introduction) extends to the spine as well. In this 
paper whenever we will discuss the order of a phase transition we will do it with respect to this latter order 
parameter. 

We would like to obtain a complete classification of the order of the phase transition in random satis¬ 
fiability problems. A preliminary problem we have to deal with is characterizing those problems that have 
a sharp threshold: indeed. Physics considerations require that, in order that the study of the (order of the) 
phase transition to be meaningful, the order parameter (in the case of A:-SAT the spine) has to be, w.h.p., 
concentrated around its expected value (in Physics parlance it is a self averaging quantity), and it is zero up 
to a certain value of the control parameter (in our case constraint density c) and positive above it. In the case 
of A:-S AT these conditions imply the fact that A;-SAT has a sharp threshold. The argument (a “folklore” one) 
can be formally expressed by the following 

Lemma 2.1 Let c be an arbitrary constant value for the constraint density function. 

1. If c< ljm n ^oo c k-SAT{ri) then lim, woo l g ^ e ( $ )l = o. 

2. If for some c there exists 6 > 0 such that w.h.p. (as n —► oo) \ s P m A®)\ > § p ien Prob[<f> E 

SAT ] = 0, that is c > lim n -> 00 Ck_sAT{ n )- 

The argument is generic enough to extend to all constraint satisfaction problems. So a necessary condi¬ 
tion for the study of the phase transition to be meaningful is that the problem have a sharp threshold. 


3 Coarse and sharp thresholds of random generalized satisfiability problems 

In this section we obtain a complete classification of thresholds of random satisfiability problems, under 
Molloy’s recent model of random constraint satisfaction problems from m (specialized to satisfiability 
problems, i.e. problems with domain {0,1}). This affirmatively solves an open problem raised in fTT! . 

ofc 

Definition 2 Consider the set of all 2 Z —1 potential nonempty binary constraints on k variables X \,..., Xf.. 
We specify a probability distribution V which selects a single random constraint, and let C = supp(V ) be 
the set of constraints on which V assigns positive probability. 

A random formula from SAT n m{"P) A specified by the following procedure: 

• n is the number of variables. 

• M is the number of clauses, chosen by the following procedure: first select, uniformly at random and 
with repetition m hyperedges of the uniform hypergraph on n variables. 

• for each hyperedge choose a random ordering of the variables involved. Choose a random constraint 
according to V and apply it on the list of (ordered) variables. 





SAT(C ) refers to the random model corresponding to V being the uniform distribution on C. 

It turns out we face a technical difficulty when studying sharp and coarse thresholds in Molloy’s model; 
it cannot be directly mapped onto the constant probability model for which the notion of a sharp threshold in 
Definition [T| works. The definition of a shaip threshold we need to employ is the one from ffol 

Definition 3 SAT(V) is said to have a sharp threshold of satisfiability if there exists a function c(n) bounded 
away from 0 such that, for any e > 0 if M < ( c{n) — e)n then SATn^i'P) is «• s. satisfiable and if 
M > ( c{n ) + e)n then SATn^M^P) is a.s. unsatisfiable. On the other hand, if there exist two functions 
(n), M 2 (n) such that Mi(n)/M 2 (n) is bounded away from zero, and the satisfaction probability of ran¬ 
dom instances from SAT n ^ (V), SAT H) m 2 (P) is bounded away from both 0 and 1 then SAT(V) is said to 
have a coarse threshold. 


However, just as in da (where this was done in the case when V is the uniform distribution), for k > 3 
one can map Molloy’s model onto a modified version of the constant probability model, defined as follows: 
Let pi, ... p r be positive numbers between zero and 1. A random sample x from the model T pl ,... Pr (n,p) is 
obtained in the following way: divide the bits of x into r equal groups. Set each of the bits in the i’th group 
to 1 independently with probability p ■ pi. For this model the definitions of p e and sharp/coarse threshold 
from Definition Scarry over, and are equivalent to those from Definition^ 

Indeed, let r be the cardinality of the support of distribution V, and pi,. .. ,p r be the associated positive 
probabilities. 

In its general setting Molloy’s model is specified as follows: divide the potential constraints into groups 
of rk\ constraints, corresponding to all possible applications of the r constraint templates on a fixed set of k 
variables. For each such group, independently with probability p, we make the decision to include at least 
one of the constraints in the group with probability p = (going from M clauses to including each 

potential edge independently with probability p can be done just as in the uniform case from my 

Each realization of constraint constraint template i is chosen with probability probability pi/k\. Denote 
this model by M(n,p,pi,... ,p r ). 

Defining f(x) = [(1 + xppi/kl) ■ (1 + xpp2/k\) •...•(! + xpp r )/k\] k ' — x we have /(1) > 0 and, 
since (by a simple calculus argument) the minimum of f(x) over the choices of p % > 0, ^ p, = 1 is obtained 
when one of them is 1 and the others are zero, 


/( 


1 — p 


)>[! + 


P 


1 k\ 


fc!(l — p) 


1 

1 — p 


> 0 . 


Let a = a(ri) > 0 be the smallest solution of the equation /(«) = 0. Thus < a. 
Define, for i = 1, r, 

, _ 1/fc! - a- pi 

^ 1 1 + 1 /k\ ■ appi 


Claim 1 The following hold for any p = 0(n { k ): 


1. For every formula <h such that no two constraints on the same set of variables appear in it, 

Prob (d>) > Prob [<&]. 

M {n,p,p 1 ,...,p r ) r P (,... ,p' r ( n >P) 


Consequently 


Prob [SAT(V)\ > Prob [SAT{V)\ 

M(n,p,pi,...,p r ) r P j, ■ ■ ■ ,p' r ( n ’P) 

no two constraints on the same set of variables appear in $]. 







2. On the other hand, there exists f(n ) = 1 + o(l) such that for every formula <5 such that no two 
constraints on the same set of variables appear in it, 

Prob (<&) < f(n) Prob [$]. 

M(n,p,pi,...,pr) T p ! _ p!r (■ n,p ) 

Consequently 


Prob [SAT(V)\ < (1 + o(l)) Prob [SAT(V)\ 

M(n,p,pi,...,p r ) r p / _... }P ' r (n,p) 

no two constraints on the same set of variables appear in $]. 


Indeed, consider the set of constraints on a fixed set of given variables. The probability (under T p > iP >) 

that a given clause of type i is included, and none of the others arc is equal to . pl>/ , ■ [(1 —pp \)... (1— pp' r )] k '. 

PP { 

But 


w'i 

1 - pp\ 


appi/kl. 


Also 


i -vv\ 


1 

1 + appi/kl ’ 


so, by the definition of a, 


[(i-pp' 1 )...(i-p p ' r r=± 

a 

This means that the probability that in a given set of constraints exactly one constraint (of type i) is 
chosen is equal to ppi/k\ , the same as in model M. On the other hand the probability that no constraint 
is chosen is equal to [(1 — pp \)... (1 — pp' r )] kl = j~- But the same probability in model M is 1 — p, and 
we know that 1 — p > j~- In both model decisions on different sets of k variables arc independent. The 
conclusion is that M assigns a larger probability than T p / ^ to any sample x to which it assigns positive 
probability. Point (1) follows. 

Point (2) has a similar proof: by calculus the maximum value of f(x) is obtained when the pf s arc equal, 
so 

0 = f{a) < (1 + ^ ) rk ! - a < e ap - a. 
rk\ 

Since p = o(l), for lai'ge enough n e ap < 1 + ap + ( ap ) 2 , so ( ap ) 2 + a(p — 1) + 1 > 0, in other words 

a( 1 — p) < 1 + (ap) 2 . 


But the ratio of the probabilities associated to any given '!> by M and r p / ...y r (n,p) verifies 


Pl'ObM( n,p,pi,...,p r ) (^ > ) 

Pr ° b V 1 ,...^(n, P )[ $ ] 


< [a(l -p)] rfe! (fc) < (1 + (ap) 2 ) rfe! (fc). 


Since p = 6(n l k ) and k > 3 the right-hand side is a function of n that is 1 + o(l). 

To prove the result it is enough to observe that for k > 3 the expected number of times a random formula 
in r p' 1 ,...,p , r (n,p) contains two different clauses on the same set of variables is o(l), since that will imply that 
the satisfaction probabilities in the two models arc related by a 1 — o(l) factor. Indeed, this number is 


fn\ ^PPaPPp, 

w ' ^ (^) 2 J ’ 



where indices a, (3 span the set of different pairs of clauses from a group. Since each group is finite 
(contains rkl clauses) and p = 9(v}^ k ). this expected value is 9{n 2 ~ k ), which is o(l) for k > 3. □ 


Definition 4 Constraint C‘> is an implicate of C\ iff every satisfying assignment for C\ satisfies C‘>. 

Definition 5 A boolean constraint C strongly depends on a literal if it has an unit clause as an implicate. 

Definition 6 A boolean constraint C strongly depends on a 2-XOR relation if 3i, j £ \,k such that con¬ 
straint “xi f Xj” is an implicate ofC. 

Our result is: 

Theorem 1 Consider a generalized satisfiability problem SAT(V) (that is not trivially satisfiable by the 
“all zeros” or “all ones” assignment). Let C = suppifP). 

1. if some constrain t in C strongly depends on one component then SAT(fP) has a coarse threshold. 

2. if some constraint in C strongly depends on a 2XOR-relation then SAT(V ) has a coarse threshold. 

3. in all other cases SAT(V) has a sharp threshold. 

Proof. 


1. Suppose some clause C implies a unit clause. We claim that SAT{V) has a coarse threshold in the 
region where the expected number of clauses is 9{ffn). 

That the probability that such a formula is bounded away from zero in this region it is easy to see: 
consider a random formula with cffn constraints, and let H be the /^-uniform formula hypergraph. 
The expected number of pairs of edges Ci, Cj that have nonempty intersection is 



© “ ^ 


Therefore with constant positive probability (that depends on c), all vertices will have degree at most 
1 in the hypergraph H n , and the formula will be satisfiable. 

If, on the other hand, both positive and negative unit clauses are implicates of constraints in V then one 
can adapt the well-known lower bound on the probability of intersection of two random sets of size 
0(y/n) to show that, with constant probability a random formula will contain two contradictory unit 
clauses as implicates, and be unsatisfiable. 

The proof is similar in the case when only one type of unit clauses (w.l.o.g assume it’s the positive 
unit clauses) are implicates of constraints in C. Since SAT(C) is not trivial there exists a constraint 
Ci £ C with an implicate of the type x% V ... V xy,, b > 2. We deal first with the case when there 
exists a constraint C 2 f C\ such that C 2 has an unit clause as implicate. Then it is easy to construct 
a formula F consisting of b copies of C\ and one copy of C 2 that implies the (unsatisfiable) formula 
{xi,..., Xb ,TT V ... V W,}- It is easy to see that the expected number of copies of F in a random 
instance of SAT(V) with O(ffn) clauses is constant, so the probability that the instance is unsatisfiable 
is bounded away from zero. 

Finally, in the case when the only constraint in C that has an unit implicate is C\. In this case one can 
use a trick similar to that used in the last paragraph of subsection l3.3l we use half of the random copies 
of Ci to imply (random) unit clauses, and the other half to imply (random) copies of xi V ... V Iff- 
This way we can produce, with constant probability, a copy of the formula F. 




2. Suppose now that C does not fall in the first case but has a 2XOR implicate. In this case Creignou 
and Daude have shown when V is the uniform distribution (and this extends directly to the case of a 
general probability distribution as well) that p 1 / 2 = 12 (n 1 l '")) and the expected number of constraints 
is On. Let csat(V) ' 71 be the expected number of constraints corresponding to p\/ 2 - Then there exists 
6 > 0 such that, for every n, eg at(V) = c SAT(V )( n ) > $■ 

Let us consider a random formula with c ■ n of constraints. By the well known result on triangles 
in random graphs it follows that with positive probability one can use C to create a “contradictory 
triangle”. Therefore it is easy to see that for every c > 0 the satisfaction probability is bounded away 
from 1. It is easy to see than this statement, together with the fact that cg^ T ^(n) > 5 together imply 
that SAT(V) has a coarse threshold. 

3. We will concentrate in the sequel on the last one. As discussed previously, for k > 3 Molloy’s model 
can be mapped onto a version of the constraint probability model. In the case k = 2 we can establish 
the existence of a sharp threshold in a direct manner, by the same method as the one used by Chvatal 
and Reed for 2-SAT iflSl (the complete proof of this case will be presented in the full version). Indeed, 
by the first two points of the Theorem, and the assumption k = 2 the only possible constraints in V 
can be constraints x V y, x V y, x V y, x V y, x = y, and the first two are always present. 

Let us now consider the case k > 3, using the modified version of the constant probability model. We 
note first that there exists a simple observation that allows us to reduce the problem to the case when 
V is the uniform probability: the Friedgut-Bourgain result on sharp/coarse threshold properties in 
monotone problems OH uses the following result, an easy consequence of the Mean Value Theorem: 
if a monotonic property A does not have a shaip threshold (under model Y(n.p)) then there exists 
p* = p( Ti ) an d a constant C > 0 such that (for infinitely many n) 

P* ■ I(p*) < C, (2) 


where I(p*) = \ p=p ». 

The same argument works when A is considered under model T Pl .. Pr (n,p). Moreover, it is an easy 
consequence of Russo’s Lemma for r piv .. Pr (n,p) that if equation Q holds for p* and some tuple 
pi,.. .p r , then it also holds (with a different constant C) for p* and tuple p\ = ... p r = 1/r. In 
other words it is enough to obtain a contradiction to the assumption that SAT(V) did not have a sharp 
threshold in the case when V is the uniform probability, which is what we show next. 

3.1 A base case 

To prove the theorem in the uniform case we will first consider a “base case” that is easier to explain, 
and will be of use in solving the general case: let a, b be two integers (not necessarily equal), both 
greater or equal to 2. Let S' be a set consisting of two constraints C\. C 2 of arity a, respectively b, 
specified by C\ = aff V .. .x^, C 2 = x\ V ... xj,. One can represent SAT(S) in the framework of 
Definition |3 by “simulating” C\, C 2 by suitable constraints of arity max{a, b}. 

We first outline how to prove that SAT(S) has a sharp threshold: we apply the Friedgut-Bourgain 
result d and infer that if SAT(S') did not have a sharp threshold than, for some 6, 1 ) 0 ? A 7> 0 and 
some probability p = p(n ) E \p e ,pi- e ] 

(a) either Prob p=p ( n )[<3? contains some F E SAT with |F| < K] > Sq, or 

(b) there exists a fixed satisfiable formula Fq, Fo| < K such that Prob p=p ( n ) [<f> A Fn E SAT] — 
Prob[<I> E FAT] > 5 0 . 






One easy observation is that in the second alternative we can always assume that F consists of a con¬ 
junction of unit clauses: if F is satisfiable and satisfies (2), then so does the conjunction of unit clauses 
specifying one satisfying assignment of F. The first alternative is eliminated by a result (Proposition 
4.6) from fl7l . The key to disproving the second alternative, in the case of fc-SAT, is a geometric result, 
Lemma 5.7 in d- We restate it here for completeness. 

Lemma 3.1 For a sequence A = (A n ) of subsets of the n-dimensional hypercube, A C {0, l} n , 
define A to be (d, m. recoverable if the probability for a union of a random choice of d subcubes 
(hyperplanes) of codimension m to cover A is greater than efor large enough n. 

Let f{n ) be any function that tends to infinity as n oo. For fixed k, d, and e any A that is (d, 1, r)- 
coverable is ( f{n ), k, e)-coverable. 

The connection with satisfiability can be explained as follows: the set A in the application of the 
Lemma 13.11 will (intuitively) refer to the set of satisfying assignments of random formula <I>. Hy¬ 
perplanes of codimension 1 are associated to unit clauses, more precisely to the set of assignments 
forbidden by a given unit clause. The fact that A can be covered with probability e by a union of d 
random hyperplanes of codimension 1 parallels the fact that with probability e, <F A Fq becomes un- 
satisfiable. This is what the result of Friedgut-Bourgain gives us (for r = do, under the assumption 
that fc-SAT does not have a sharp threshold). Hyperplanes of codimension k correspond to the set of 
assignments forbidden by a given /c-clause, and the conclusion of the geometric lemma is that adding 
any small (but unbounded) number f{n) of random /c-CNF clauses to random formula <f> boosts the 
probability of not being satisfiable at least as much as the addition of the (constantly many) unit clauses 
in F 0 . 

For small enough f(n) this statement can be directly refuted, by concentration results for the binomial 
distribution (Lemma 5.6 in fl4K A simpler and more general way to derive it is given as Lemma 3.1 

infI91. 

The same outline works for the case we consider. To state the geometric result we need, however, to 
work with two types of hyperplanes: 

Definition 7 Let H n = {0, l} n be the n-dimensional hypercube, and let Wi denote the value ofthei’th 
bit of element w E //,,. A positive hyperplane of codimension d is a subset of points of H n defined by a 
system of equations x tl = ... = x, d = 1, where the xfis are distinct variables. Negative hyperplanes 
have a similar definition. 

Our version of the geometric Lemma is 

Lemma 3.2 For a sequence A = ( A n ) of subsets of the n-dimensional hypercube, A n C {(), 1}" 
define A to be (m, d \ yfi- m \, rn -2 , recoverable if the probability of a union of a random choice of d \ 
negative hyperplanes of codimension m\ and d 2 positive hyperplanes of codimension m >2 to cover A n 
is at least e if n > n±. Let f(n), g(n) be any functions that tends to infinity as n —> oo. For fixed 
k\,k 2 , d, and 0 < 6 < e, there exists n 2 that depends on k\, k 2 , d, e, A, ri\ (but not A) such that for 
any n > n 2 any A n C {0, l} n that is (ri\ ,d\, d. 2 , 1,1 ,e)-coverable is (n- 2 , f(n),g(n), k\, k’ 2 , e — 5)- 
coverable. 


We will in fact prove a stronger version of the Lemma: 


Lemma 3.3 For a sequence A = (A n ) of subsets of the n-dimensional hypercube, assume that 


Prob[A C Hi U ... H d ] > e 


for all n > n\, where the Hi’s are random hyperplanes of codimension 1, d\ of them negative, d -2 of 
them positive. 

Let f(n), g(n) be any functions that tends to infinity as n —> oo. For fixed k\,k, 2 , d, and S > 0, there 
exists n* = n(n\,di,d 2 ,k\,k 2 ,e, 6, /, g), (however it does not depend on A) such that for any n > n* 
and any i, 0 < i < d 

id 

Pr[A CPjU...U Pi fM U ATi... U N igM U H i+1 U ... U H d \ > Pr[A C H x U ... U H d ] - —, (3) 

d d CL 

where the Nfis are random negative hyperplanes of codimension k\ and the Pi’s are random positive 
hyperplanes of codimension h>. 

Proof. 

It is easy to see that one can assume that d\f(n), d\g(n ) (since it is enough to prove the lemma for 
70 ) = d[f(n)/d\, g{n) = d[g(n)/d\). 

We will prove the lemma by double induction on d \. d^. By symmetry we only need to consider two 
“base cases:” 


Case 1: d\ = 0, d 2 = 1 


In this case (and the dual, d\ = 1, d -2 = 0) we can replace, for i = 1, equation EJby the stronger: 

ProbO CP 1 U...U P f{n) U JVi... U N g{n) ] >1-6. (4) 

The hypothesis implies that for n > n\ there exist n ■ e positive hyperplanes of codimension 1 such 
that 

An c p[ n) n ... n p™. 

We will assume, w.l.o.g., in what follows that A is in fact equal to the right hand side. If KP is a 
random positive hyperplane of codimension k\ then 


Prob [A g KP] = 1 - 


&•) 

0 ' 


S n ) 




Indeed, suppose KP is specified by the (random set of) equations x\ = ... = x K ki = 1. The condition 
that A C KP is equivalent to 

where {p, ,..., \ are Ihe literals that specify the hypeiplanes P ,,..., PpJ. 

Thus the probability that A is included in the union of g(n) random positive hyperplanes KP, of 
codimension k\ is at least 1 — Prob [(Vi) : A f. KP,], which is equal to 

(n-e\ 

1 - (1 - Y^y) ff(n) ~ 1 — (1 — e kl ) 9(n) —> 1 as 77. —> oo. 

(fcl) 




It follows that there exists n* = n(n\, di,d, 2 , k\ , ^ 2 , e, /, <?) such that for n > n* the left-hand side 
is larger than 1 — 5. 

Case 2: di + c ?2 > 1 

It is enough to prove that there exists n t = n(n\, d\, d, 2 , k\, e, 5, /, g, i) such that 0 holds, for a 
fixed value of i, 0 < % < d, when n > rii. Then we can take n* = max{rii}. 

We prove this on induction on i. The claim is clearly true for i = 0. Assume, therefore, that the claim 
is true up to z; we will prove it for z + 1 . 

Denote for all j 

pj = Prob [A CPjU.-.U P(j-i)fM UJV 1 U...U N U -i) g (n) U Hj U ... U H d \. 

d d 

To accomplish that it is enough to show that 

Pi+1 >Pi~5, (5) 


By the Bayes formula: 


Pi = Prob [A C P 1 U ... U P(i—i) f (n) U Mi U ... U N ^ 1)g(n) U Hi U ... U H d ] = 

d d 

= Pr\B C Hi\A \ (Pi U ... U P(i-i)/(n) U N± U ... U JV(<-i) 9 (n)) = Pi • 

B d d 

■ Prob [A \(Pi U ... U P (i—i)/ ( n) U Mi U ... U N g-p^n) ) = B] 

d d 

Assume without loss of generality that Hi is a positive hyperplane. 

Let 7 = Let 

C 7 = {B C {0, l} n : Pr[B C P] > 7 }. 

Let a be the sum of those terms in ^corresponding to sets B £ C' 7 , and let /3 be the sum corresponding 
to sets B £ C^y. 

From the definition of C~, it follows that 

0 < P < 7, 

therefore 

Prob [A \ (Pi U ... U P (i-i)/(n) U Ni U ... U N (j-i ) g ( n ) ) £ C 7 ] > 

d d 

> — • [Prob[A C Pi U ... U P(i-i)/(n) U N\ U ... U N (j-i) g („) U Hi U ... U H^] — 7 ] = 

'Y d d 

= ~ ■ \Pi -7]- 

7 


On the other hand 














p i+ 1 = Prob [A C Pi U ... U Pif(n) U JVi U ... U N ig(n) U H i+1 U ... U H d \ = 


"y ^ Pr\B C P (j—\)f(n) \ ^ U 


B 


U Pif(n) U N (i-l)g(n) , 1 U ... U JV ig(n) 
d d ' d 


A \ (P\ U ... U P(i~i)f( n ) U N± U ... U N(i-i) g („)) = B] ■ 

d d 

■ Prob[A \ (Pi U ... U Pg-pfM U Ni U ... U N (i -p gM ) = B] 

d d 


Let f(n) = f(n)/d , g{n) = g{n)/d. Since the iVj’s, P,’s are random hypcrplancs, one can rewrite the 
previous recurrence as 


p i+ 1 = Prob [A C Pi U ... U P if ( n ) U N x U ... U N ig(n) U H i+1 U ... U H d } = 

d d 

= J2^ob[BCP 1 U...UP- m UN 1 U...UN^\ 

B 

A \ (Pi U ... U P(i-i)f(n) U JVi U ... U N (j—x)tj(n) ) = B ] • 

d d 

■ Prob[^4 \ (Pi U ... U P (t—i)f(n) U TVi U ... U N (i - 1)g(n) ) = B} 

d d 

Since all terms are nonnegative, one can obtain a lower bound on the left-hand size of this latter terms 
by only considering those B E C 7 . 

Applying the induction hypothesis from case one for all B e C~, and n > n t = n*(ni, 0,1, k\, h-), 7 . 5. f - Jj), 
we infer that for all such B, 

Prob[-B CP 1 U...U Pj^j U JVi U ... U y] > (1 - 7 ), 

therefore 


Pi+i = Prob [A C Pi U ... U Pif(n) U ATi U ... U N ig(n) U H i+1 U ... U H d }> 

d d 

(1 - 7) • V Pr[A \ (P\ U ... U Pg-I)f(n) U Ni U ... U N (i-i)g(n ) ) = B] 
BeC-, d d 

= (1 — 7) • Pr [A \ (Pi U ... U Pa-i)f(n) U N\ U ... U N g- i) g ( n ) ) € C 7 ] 

d d 

(1 — O') 

> --- • [Prob [A CP1U...U P a-i)f(n) U Ni U ... U N (i - 1)a(n) U Hi U ... U H d ] - 7] = 

'Y d d 


(! -7) 
7 




Since 7 < 1, 


Pi+i > (1 - 7 ) • (Pi ~ 7 ) = Pi - 7 • [1 + Pi - 7 ] > Pi - 27 - 


which is precisely equation 0 (that we wanted to prove). 


□ 




















3.2 How to contradict Lemma I3l2l 


It is now easy to infer the fact that SAT(S) has a sharp threshold, by using the previous le mm a with 
d\ = \F 0 n Var\, d 2 = |To| — d\, k\ = b, hi = a, e = 5o> A = 5$/2 and a refutation of the geometric 
lemma similar to the one for 3-SAT. 

The conclusion of the Geometric Lemma (similar to the one for k-SAT) is that, by adding any number 
f(n) of copies of x\ V ... V x a and g(n) copies of sq V ... V x^ suffices to boost the unsatisfiability 
probability by a constant. 

However, Lemma 3.1 from d asserts that adding up to o(y / (n)) random clauses is not enough to 
boost the unsatisfiability probability by more than o(l). 

Because of the nature of the random model, adding H(n) = o(\/(n)) random clauses insures that 
w.h.p. we have 0 (H (n)) copies of each type of clause, as long as H ( n ) grows faster than some power 
of n. Taking the number of such copies to be the functions f(n), g(n ) contradicts the consequence of 
the Geometric Lemma. 

Note that we do not make use of all the details of the random model (such as the precise number of 
copies of each clause in random instances at p = p{n)), but only that: 

• the expected number of copies of both C\ and C 2 in a random formula at p = pin ) is unbounded. 
This is enough to make the analog of Lemma 3.1 from work. 

• the clauses are independent. 

3.3 Putting it all together 

In the previous section we have proved a geometric lemma that is used to prove that the above-defined 
set SAT(S) has a sharp threshold. 

Consider now a set C of constraints that satisfies the condition (3) of the theorem. Since SAT(C) is 
not trivially satisfied by the “all zero” (all ones) assignment, there exist constraints C\, Co in C and 
a, b > 1 such that C\ |= x\ V ... x a , C 2 \= xT V ... V 5T- In fact a,b> 2, otherwise some constraint 
in C would strongly depend on one variable. 

Just as in the base case, condition (i) in the Friedgut-Bourgain result is eliminated by the result of 
Creignou and Daude, and the formula To in condition (ii) can be assumed to consist of a conjunction 
of unit clauses. Reflecting this fact, the geometric lemma needed for the general case has the same 
hypothesis as the one of Lemma l3~2l the set A can be covered by a union of random hyperplanes of 
codimension 1. However, the covering desired in the conclusion no longer consists of hyperplanes, 
but of (general) sets of points in the hyperplane, corresponding to sets of assignments forbidden by a 
certain constraint C. 

The critical observation (easiest to make in the case when C\ / C 2 ) is that Lemma 13721 implies the 
geometric result we need for this case: since C\ \= x\ V ... x a , the “forbidden set” associated to 
C\ contains the “forbidden set” associated to constraint x\ V ... x a in Lemma 13.21 the hyperplane 
x\ = ... = x a = 0. A similar result holds for C 2 and the positive hyperplanes. 

Thus each “covering set” in a conclusion of the (general case of the ) geometric lemma contains 
a corresponding “covering set” from the conclusion of Lemma 13.21 In other words the geometric 
lemma for SAT(C) follows from the geometric lemma for the base case by monotonicity. 

All is left to show is that the analog of Lemma 3.1 from 03 also works in this general case. We 
have previously observed that this amounts to showing that the expected number of copies of Cj, 
C 2 in a random formula is unbounded. But Proposition 3.5 in ITT! (slightly generalized to probability 


distributions V that are not uniform) and the details of the random model imply that in fact this number 
is linear. 

A simple modification of this argument holds when C\ = C-i. To see this, note that in the proof of the 
fact that Lemma l3~2l holds for some small enough (but unbounded) f(n),g(n) is enough to obtain a 
contradiction. 

The expected number of copies of C\ in a random formula is linear, denote it by h(n). Dividing the 
set of such copies into two (random) sets of cardinality h(n)/2 yields infinitely many random copies 
of C\ used to imply x\ V ... V x a in the previous argument and infinitely many copies used to imply 
~xi V ... V Xb. We then apply the same strategy as in the first case. 

To summarize: the proof follows from the corresponding argument for the base case by monotonicity. 
It critically uses the fact that we are not in cases (i) or (ii) of the Theorem, since it is only under these 
conditions when the first alternative in the Friedgut-Bourgain argument can be eliminated. 


□ 


4 3-SAT has a first-order phase transition 

Theorem 2 k-SAT, k > 3 has a first-order phase transition. In other words there exists q > 0 such that for 
every sequence p = p(n) we have 

lim Prob [<f> E SAT] = 0 lim Prob r \Sp' ne ( — 2 ! > A = 1 . (6) 

n—KX) p=p(n) n—>co p=p( K n) Tl 


Proof. 

We start by giving a simple sufficient condition for a literal to belong to the spine of the formula: 

Claim 2 Let be a minimally unsatisfiable formula, and let x be a literal that appears in <f>. Then x E 
Spine(Q). 

Proof. Let C be a clause that contains x. By the minimal unsatisfiability of ( I>, ( I> \ { C } E SAT. On the 
other hand $ \ {C} A { x } E SAT, otherwise 'I> would also be satisfiable. Thus x E Spine f\> \ { C}). □ 

Thus, to show that 3-SAT has a first-order phase transition it is enough to show that a random unsatisfiable 
formula contains w.h.p. a minimally unsatisfiable subformula containing a linear number of literals. A way 
to accomplish this is by using the two ingredients of the Chvatal-Szemeredi proof IT5ll that random 3-SAT 
has exponential resolution size w.h.p. They are explicitly stated to make the argument self-contained: 

Claim 3 There exists a constant 5 > 0 such that for every constant c > c^sat with high probability (as 
n —» oo) a random formula <I> with n variables and cn clauses has no minimally unsatisfiable subformula of 
size less than 5 ■ n. 

Claim 4 There exists r/ > 0 so that w.h.p. for every c > c^-sat all subformulas of a random formula <f> 
having between (5/2) ■ n and 5 ■ n clauses contain at least q ■ n (pure) literals (corresponding to different 
variables). 

The argument is now transparent: if <t> is unsatisfiable then w.h.p. a minimally unsatisfiable subformula 
E of $ has size at least 5n. By the second claim, applied to an arbitrary subformula of E of size (3<5n)/4, we 
infer that w.h.p. E contains at least many // • n different variables. 




5 First-order phase transitions and resolution complexity of random gener¬ 
alized satisfiability problems 

In this section we extend the previous result (and the connection between first-order phase transitions and 
resolution complexity) to other classes of satisfiability problems. Interestingly, we find that a condition 
Molloy investigated in m is a sufficient condition for the existence of a first-order phase transition. 

It turns out that there are differences between the case of random k- SAT and the general case that force us 
to employ an alternative definition of the spine. The most obvious one is that formula (□> involves negations 
of variables, whereas Molloy’s model does not. But it has more serious problems: consider for example k- 
uniform hypergraph 2-coloring specified as SAT ({Co}), where Cq{x\, ..., xf) has the interpretation “not 
all of x\,... Xk arc equal”. Because of the built-in symmetry to permuting colors 0 and 1 , the spine of 
any instance is empty (under definition [T]). Similar phenomena have appeared before (and forced a different 
definition of the backbone/spine) in k-coloring m (symmetry = permutation of colors) or graph partition 
l2H (symmetry = permutation of sides). There are other ways (to be detailed in the journal version of the 
paper) in which the original definition of the spine behaves differently in the general case than in that of 
random k-S AT. Our solution is to define the concept of spine of a random instance of a satisfiability problem 
SAT(V) in a slightly different way. The definition is consistent with those in 1201 . fTTI . 


Definition 8 Spine(&) ={i£ Var\(3)E C$,5 6 SAT, (3)C E C,x E C, such that HA C £ SAT}. 

It is easy to see that, for A:-CNF formulas whose (original) spine contains at least three literals a variable 
x is in the (new version of the) spine if and only if either x or x were present in the old version. In particular 
the new definition does not change the order of the phase transition of random k-SAT. Moreover the proof of 
Claim Qcarries over to the general case. The resolution complexity of an instance <h of SAT(P) is defined 
as the resolution complexity of the formula obtained by converting each constraint of T to CNF-form. 

Definition 9 Let V be such that SAT(V) has a sharp threshold. Problem SAT(V) has a first-order phase 
transition if there exists // > 0 such that for every sequence p = p(n) we have 

lim Prob [<F E SAT] = 0 =>■ lim Prob J'~^ ne ( —H. > r A = l. ( 7 ) 

n—>oc p=p(n) n—>oc p=p(n) Tl 


If on the other hand, for every epsilon > 0 there exists p e (n) with 


lim Prob [<J? E SAT] = 0 and lim Prob > e ] = q 

n—>oc p=p e (n) n^cop=p(n) n 


( 8 ) 


we say that SAT(T) has a second-order phase transition 1 . 

A first observation is that a second-order phase transition has computational implications: 

Theorem 3 Let V be such SAT{V) has a second-order phase transition. Then for every constant c > 
limn^ocCgATCP) an d every a > 0 , random formulas of constraint density c have w.h.p. resolution 
complexity 0(2 k ' a ' n ). 

Proof. 

By the analog of Claim[2]for the general case, if SAT(V) has a second-order phase transition) then for 
every c > csat(P ) ar| d for every a > 0 , minimally unsatisfiable subformulas of a random formula <I> with 

1 strictly speaking the order of the phase transition is at least two. 






constraint density c have w.h.p. size at most a ■ n. Consider the backtrack tree of the natural DPLL algorithm 
(that tries to satisfies clauses one at a time) on such a minimally unsatisfiable subformula F. By the usual 
correspondence between DPLL trees and resolution complexity (e.g. ffl, PP- 1) it yields a resolution proof 
of the unsatisfiability of $ having size at most 2 k ' a ' n+l . 

□ 


Definition 10 For a formula F define c*(F) = max{ '' Con ^y^Q^ G ^ :I/GC F}. 

The next result gives a sufficient condition for a generalized satisfiability problem to have a first-order 
phase transition. 


Theorem 4 Let C be a set of constraints such that SAT(C ) has a sharp threshold. If there exists e > 0 such 
that for every minimally unsatisfiable formula F it holds that 


c*{F) > 


1 + e 
k- 1 


then for every V with suppfP ) = C 

SAT{V ) has a first-order phase transition. 


Proof. 

We first recall the following concept from ED: 

Definition 11 Let x. y > 0. A k-uniform hypergraph with n vertices is (x,y)-sparsc if every set of s < xn 
vertices contains at most ys edges. 


We also recall Lemma 1 from the same paper. 


Lemma 5.1 Let k,c > 0 and y be such that {k — 1 )y > 1. Then w.h.p. the random k-uniform hypegraph 
with n vertices and cn edges is (x, y)-sparse, where 


e = y-l/(k-l), 
Ze ce 


The critical observation is that the existence of a minimally unsatisfiable formula of size xn and with 
c*(F) > -jAfi implies that the fc-uniform hypergraph associated to the given formula is not (x, y)-sparse, for 

y=kh- 

But, according to Le mm a l5~Tl w.h.p. a random fc-uniform hypergraph with cn edges is (x( h y) sparse, for 
x o = f 27 (^-) y ) 1 ,/f (a dirrect application of Lemma 1 in their paper). We infer that any formula with less than 
.x'o ■ n/K constraints is satisfiable, therefore the same is true for any formula with xq ■ n/K clauses picked 
up from the clausal representation of constraints in T. 

The second condition (expansion of the formula hypergraph) can be proved similarly. 

□ 

One can give an explicitly defined class of satisfiability problems for which the previous result applies: 

Theorem 5 Let V be such that SAT(V) has a sharp threshold. If no clause C G C = suppiV) has an 
implicate of length at most 2 then 




1. for every minimally unsatisfiable formula F 


c*(F) > 


2 

2k - 3' 


Therefore SAT(V ) satisfies the conditions of the previous theorem, i.e. it has a first-order phase 
transition. 


2. Moreover SATlfP ) also has resolution complexity 1 . 


Proof. 


1. For any real r > 1, formula F and set of clauses G C F, define the r-deficiency of G, d r (G) = 
r\Clauses(G)\ - \Vars{G)\. 

Also define 

(5;(F) = max{5 r (G):0/GCF} (9) 

We claim that for any minimally unsatisfiable F, <F >k : ,(F) > 0. Indeed, assume this was not true. 
Then there exists such F such that: 

8 2k -3 (G) < -1 for all 0 + G C F. (10) 


Proposition 1 Let F be a formula for which condition \W\ holds. Then there exists an ordering 
C \,..., C\p\ of constraints in F such that each constraint Ci contains at least k — 2 variables that 
appear in no Cj, j < i. 

Proof. Denote by Vi the number of variables that appear in exactly i constraints of F. We have 

i ■ Vi = k ■ \Constraints(F)\. 

i> 1 

therefore 2\Var{F)\ — v\ < k ■ \Constraints{F)\. This can be rewritten as v\ > 2\Var(F)\ — 
k\Constraints(F)\ > \Constraints{F)\ ■ (2 k — 3 — k) = (k — 3) • \Constraints(F)\ (we have 
used the upper bound on c*(F). Therefore there exists at least one constraint in F with at least k — 2 
variables that arc free in F. We set C|/7 = C and apply this argument recursively to F\C. □ 


Call the k — 2 new variables of Cifree in Ci. Call the other two variables bound in Ci. Let us show 
now that F cannot be minimally unsatisfiable. Construct a satisfying assignment for F incrementally: 
Consider constraint Cj. At most two of the variables in Cj are bound for Cj. Since C has no implicates 
of size at most two, one can set the remaining variables in a way that satisfies Cj. This yields a 
satisfying assignment for F, a contradiction with our assumption that F was minimally unsatisfiable. 

Therefore ^-3 (F) > 0 , a statement equivalent to our conclusion. 

2. To prove the resolution complexity lower bound we use the size-width connection for resolution com¬ 
plexity obtained in fl 6 l : we prove that there exists ij > 0 such that w.h.p. random instances of 
SATfP) having constraint density c have resolution width at least rj ■ n. 

We use the same strategy as in ED 


2 this result subsumes some of the recent results in ca 




(a) prove that w.h.p. minimally unsatisfiable subformulas are “large”. 

(b) prove that any clause implied by a satisfiable formula of “intermediate” size will contain “many” 
literals. 

Indeed, define for a unsatisfiable formula $ and (possibly empty) clause C 

p{C) = min{|S| : E C <b, S |= C}. 

Claim 5 There exists rj 1 > 0 such that for any c > 0, w.h.p. //(□) > t]i ■ n ( where F is a random 
instance of SAT(V) having constraint density c). 

Proof. In the proof of Theorem 0 we have shown that there exists r/o > 0 such that w.h.p. any 
unsatisfiable subformula of a given formula has at least % ■ n constraints. Therefore any formula made 
of clauses in the CNF-representation of constraints in F, and which has less than ?/o • n clauses is 
satisfiable, and the claim follows, by taking r/i = //q. □ 


The only (slightly) nontrivial step of the proof, which critically uses the fact that constraints in V do not 
have implicates of length at most two, is to prove that clause implicates of subformulas of “medium” 
size have “many” variables. Formally: 

Claim 6 There exists d > 0 and r/9 > 0 such that w.h.p., for every clause C such that d/2 ■ n < 
!-i{C) <= dn, \C\ > 772 ■ n. 

Proof. Take 0 < e. It is easy to see that if c*(F) < 2 k- 3 +c then w -h.p. for every subformula G of 
F, at least | • \C onstraints{G)\ have at least k — 2 private variables: Indeed, since c*(G) < 2fc „ 2 3+e , 
by a reasoning similar to the one we made previously fi(G) > (k — 3 + e)\C onstraints(G)\. Since 
constraints in G have arity k, at least e/3 • \C onstraints(G) \ have at least k — 2 “private” variables. 

Choose y = 9fc _ 2 3+c in Lemma I57TI for e > 0 a small enough constant. Since the problem has a sharp 
threshold in the region where the number of clauses is linear, 

d = inf{x(y, c) : c >= c SAT{P) } > 0 . 

W.h.p. all subformulas of $ having size less than d/k ■ n have a formula hypergraph that is ( x,y )- 
sparse, therefore fall under the scope of the previous argument. 

Let E be a subformula of <b, having minimal size, such that S |= C. We claim: 

Claim 7 For every clause P of E with k — 2 “private ” variables, (i.e. one that does not appear in any 
other clause), at least one of these “private” variables appears in C. 

Indeed, supppose there exists a clause D of E such that none of its private variables appears in C. 

Because of the minimality of H there exists an assignment F that satisfies E \ { I )} but does not satisfy 
D or C. Since D has no implicates of size two, there exists an assignment G, that differs from F 
only on the private variables of D, that satisfies E. But since C does not contain any of the private 
variables of D, F coincides with G on variables in C. The conclusion is that G does not satisfy C, 
which contradicts the fact that E 1= C. 





The proof of Claim | 6 ] (and of item 2. of Theorem 13 follows: since for any clause K of one of the 
original constraints p(K) = 1, since //(□) > rji ■ n and since w.l.o.g. 0 < d < rji (otherwise replace 
d with the smaller value) there exists a clause C such that 

p(C) E [d/2k ■ n, d/k ■ n}. (11) 

Indeed, let C' be a clause in the resolution refutation of $ minimal with the property that p(C') > dn. 
Then at least one clause C, of the two involved in deriving C' satisfies equation fill 

By the previous claim it C contains at least one “private” variable from each clause of S. Therefore 
\C\ > r /2 ■ n, with 772 = d/2k ■ e. 


□ 

It is instructive to note that the condition in the theorem is violated (as expected) by random 2-SAT, 
as well as by random 1-in-fc SAT: the formula C(x\, X2 ,..., Xk-u xf) A C(xk, Xk+i, ■ ■ ■, X2k-2, x\) A 
C(x\,X 2 k-\i ■ ■ ■ A C(xk,x 3 fe_ 2 , • • • ,X 4 fc- 4 ,*i) (where C is the constraint “ 1 -in-fc”) is mini¬ 

mally unsatisfiable, but has clause/variable ratio l/(fc — 1) and implicates T\ V ayT and a; 1 V ay.. 

It would be tempting to speculate that whenever both a; V?/ and x V y are implicates of clauses in C then 
SAT{V) has a second-order phase transitions for every distribution V with supp(V ) = C. That is, however, 
not true, at least for some distributions V. Consider the random (2 + p)-SAT model of Monasson et al. (2). 
In this model p is a fixed real in [0,1], A random instances of (2 + p)-SAT with n variables and c • n clauses 
is obtained by choosing pen random clauses of length 3 and (1 — p)cn random clauses of length 2. It was 
shown in El (using the nonrigorous replica method) that 

1. (2 + p)-SAT has a second-order phase transition for 0 < p < pu ~ 0.413.... 

2 . the transition becomes first-order for p > p^. 

3. when the transition changes from second-order to first-order the complexity of a certain DPLL algo¬ 
rithm changes from polynomial to exponential. 

Several rigorous results have complemented these findings. Achlioptas et al. El have shown that for 
()</>< 0.4 the phase transition in (2 +/>)-SAT only depends on the “2-SAT paid”. One can perhaps use the 
techniques of fO to confirm statement (i). 

It is easily seen that (2+p)-SAT can be represented in Molloy’s framework. On the other hand Achlioptas, 
Beame and Molloy [:9l have shown that for those p for which (2 + p)-SAT does not behave like 2-SAT the 
resolution complexity of the problem is exponential. Using Theorem[ 3 ]and and results in 0 we get: 

Theorem 6 Let p E [0,1] be s.t. there exist e > 0 and c < hj—C s.t. random instances of (2 + p)-SAT with 
n variables and c ■ n clauses are w.h.p. unsatisfiable. Then (2 + p)-SAT has a first-order phase transition. 

It would be interesting to obtain a complete characterization of the order of the phase transition in an ar¬ 
bitrary problem SAT(V). Such a characterization, however, requires substantial advances: Exactly locating 
the “tricritical point” po in random 2+p-SAT (or merely deciding whether it is equal or not to 0.4) is an open 
problem. A complete characterization would yield a solution to this problem as a byproduct. 

On the other hand Theorem | 6 | suggests an interesting conjecture: whenever the location of the phase 
transition is not determined by the implicates of size at most two in the given formula, the phase transition in 
SAT(V ) is first-order. Perhaps techniques in (9j can help settle this question. 



6 Discussion 


We have shown that the existence of a first-order phase transition in a random satisfiability problem is often 
correlated with a 2 i}l ' n) peak in the complexity of resolution/DPLL algorithms at the transition point. 

As for the extent of the connection it is easy to see that it does not extend to a substantially larger class 
of algorithms: consider random A-XOR-SAT, the problem of testing the satisfiability of random systems of 
linear equations of size k over Z 2 . A-XOR-SAT is a version of XOR-SAT, one of the polynomial time cases of 
satisfiability from Schaefer’s Dichotomy Theorem ED Indeed, it is easily solved by Gaussian elimination. 
But Ricci-Tersenghi et al. f24l have presented a non-rigorous argument using the replica method that supports 
the existence of a first-order phase transition, and we can show this formally (as a direct consequence of 
Theorem [5j: 

Proposition 2 Random k-XOR-SAT, k > 3, has a first-order phase transition. 

To sum up: the intuitive argument states that a first-order phase transition correlates with a 2 v - tn ' > 
lower bound of the complexity of DPLL algorithms at the transition. This is true in many cases, and 
the underlying reason is that the two phenomena (the jump in the order parameter and the resolu¬ 
tion complexity lower bound) have common causes. However, at least for satisfiability problems, this 
connection does not extend substantially beyond the class of DPLL algorithms. 
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